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ABSTRACT 

Three experiments were conducted in order to 
investigate teacher- and experimenter- bias effects on children's 
learning and performance. Teacher-bias effects on children's Stanford 
Achievement Test (SAT) performance were assessed in a one and a half 
year longitudinal study in two second- and two fourth-grade 
classrooms. The teacher-bias manipulation had no significant 
relationship to SAT performance on any of the testings, but teacher 
ranking was strongly and consistently related to SAT performance. The 
results indicate teachers are good predictors of student academic 
potential but do not bias students' achievement test performance. Two 
experimental studies were also conducted in order to examine 
developmental trends and mode of inducing bias on children's 
susceptibility to experimenter bias. College students were trained to 
be experimenters in a simple motor performance task, marble dropping, 
in which experimenter-bias effects had been demonstrated previously. 
The major finding was a significant Age x Bias Condition X Sex of 
Subject (S) interaction indicating a general trend for older Ss to be 
more influenced by biasing effects of Es than younger Sr. The 
significant effects were unclear with respect to node of biasing the 
experimenters. The test was disguised as a test to predict academic 
potential. (Author) 
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Introduction 

The central problem under investigation was the effect of adult/ 
teacher expectations on children's learning and performance. Three 
studies were conducted to provide information relevant to the follow- 
ing three questions: a) Are teacher-bias or teacher-expectancy effects 
observable in measures of academic performance? b) Are these effects 
observable only when induced in the teacher or experimenter by the 
principal investigator as opposed to being self-gene»^ated by the 
adult? c) Are there developmental trends in susceptibility to adult 
expectancy effects? 

A constructive context within which to view this research is 
to conceive of the classroom or experimental setting as a social 
interaction situation in which the adult and the child are con- 
tinually interacting with each other. There is considerable evi- 
dence that during this interaction teachers treat groups of stu- 
dents differently (e.g., Davidson, 1972; Good & Brophy, 1970; 
Schwebel & Cherlin, 1972) and that the manner in which a teacher 
reacts toward a student influences the student's self-concept 
and classroom performance. Meyer and Thompson (1956) have shown 
that teacher disapproval is distributed unequally between boys 
and girls and furthermore that both boys and girls are aware that 
boys are recipients of more disapproval than are girls. More 
recently, Davidson (1972) has shown both sex differences and social 
class differences in students' reactions to teacher reinforcement. 
Davidson and Lang (1960) have reported that students' self-concept 
is directly related to teachers' use of approval and disapproval. 
These investigators have also provided evidence indicating that the 
more positively the student views the teacher's feelings toward 
him the better his academic achievement. Finally, Staines (1958) 
has shown that by specific behaviors toward pupils with low self- 
concepts the teacher can effectively raise the students' rated 
self-concept. It is apparent, then, that the student is aware of 
what the teacher thinks of his abilities and personality and that 
this awareness plays an important role in the child's developing 
self-concept. It is not difficult to imagine that the student also 
has some perception of the teacher's views regarding his academic 
potential. The crucial question is whether this may affect the 
student's performance in the academic setting. Similar evidence 
(e.g., Rosenthal, 1966; Friedman, 1967) indicates analogous processes 
may operate in the experimental situation. 

Research bearing on these issues falls into three categories. 
First, there is a body of research dealing with experimenter bias 
effects in psychological research. This research has been thoroughly 
reviewed by Rosenthal (1966, 1968a, 1969a,b), Friedman (1967) and 
Barber and Silver (1968a, 1968b). The literature in this area is a 
clear demonstration that under certain conditions experimenters may 
intentionally or unintentionally bias the performance of adults 
(Rosenthal, 1966; Barber & Silver, 1968a, 1968b) or children (Dusek, 
1971; 1972) in psychological experiments. Second, there are several 
studies in which expectancy effects and self-fulfilling prophecies 
have been investigated in tutoring situations involving student 
teachers (e.g., Beez, 1968; Rubovits & Maehr, 1972). Third, there are 
a number of studies in which teacher expectancy effects in elementary 
school classrooms, or other classroom situations, have been investigated 
(e.g., Rosenthal & Jacobson, 1968; Claiborn, 1969; Dusek & O'Connell, 1973). 
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In the following literature review the term "teacher-bias or 
exper1menter-bias effects" will refer to significant effects due to 
teacher/experimenter differential expectations for children's per- 
formance, but only in the case involving induction of expectancies 
by a principal investigator. That 1s, bias effects will be due to 
a manipulation, or attempted manipulation, of expectancies by an 
investigator. Such effects are analagous to the effects reported by 
Rosenthal (1966) and Rosenthal and Jacobson (1968) and are bias in 
the sense that the adult has differential expectations regarding 
the performance of children who are equivalent on some objective 
measure. The term "expectancy effects" will refer to significant 
effects due to the adults' own, self-generated expectations regarding 
children's performance. In this case, it Is the adults' own expec- 
tancy, formed however adults form it, which Is related to children's 
performance. This distinction will prove critical in interpreting 
the findings reported below. Each of these areas of research will 
be reviewed below. 



Literature Review 
In-clas'^room studies of teacher bias effects . 

A number of studies have investigated teacher bias effects in 
elementary school classrooms. That teachers do form expectations 
regarding specific children who wHl or will not perform well in 
certain academic subjects has been clearly shown by Rist (1970). Rist 
found that a kindergarten teacher placed children into groups on the 
basis of her subjective impressions regarding their likelihood to 
succeed in the academic situation. Once placed into one of the three 
levels of this caste system it was nearly impossible for a child to 
change his classification from one of the lower groups Into the top 
group. Furthermore, Rist found that a child's group placement by the 
kindergarten teacher was maintained by the first- and seccnd-grade 
teachers. At the second-grade level the teacher used reading level to 
assign the groups. Rist, however, notes that the use of "objective" 
data may simply have been a rationalization for placing the children 
into groups on the basis of other reasons, such as neatness, social 
class, etc. Unfortunately, Rist did not systematically examine the 
effects of teacher expectations on school artd achievement test perfor- 
mance. Hence, the extent to which teacher expectations biased the 
children's education, if at all. is not known. 

The first study of teacher bias effects was conducted by Rosenthal 
and Jacobson (1968) in an elementary school serving primarily a lower 
social class neighborhood. At the beginning of the school year all Ihe 
children in grades 1-6 were given an IQ test, Flanagan's (1960) Test of 
General Ability (TOGA), disguised as a test to predict "academic blooming". 
The test was given again at the middle and end of the school year. Within 
each of the 18 classrooms approximately 20% of the children were randomly 
chosen to form an experimental group. The names of these students were 
given to their teachers and 1t was explained that these children had 
scored on the test in such a manner as to predict that they would show 
large gains in intellectual ability during the school year. Across all 
classrooms the year-end test scores showed an approximately 4 point 
advantage for the children in the experimental group. However, at the 
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first- and sacond-grade levels the children in the experimental 
group showed gains of as much as 15 IQ points more than the 
children in the control group. In terms of school performance, 
the children in the experimental group showed a significantly 
better gain than the children in the control group only for 
reading, one of the 11 school grades considered. On the basis 
of these data Rosenthal and Jacobson (1968) concluded that the 
children in the experimental group gained more than children 1n 
the control group during the course of the academic year because 
the teachers expected a higher level of performance from them. 

R. L. Thorndike (1968, 1969) has criticized the Rosenthal and 
Jacobson research on several grounds, including faulty pre- and 
post-test data and the suggestion that students may not have 
attempted a large number of ftems. thus lowering their IQ scores 
and. essentially, making the test a poor measure. Jensen (1969) 
has attacked the Rosenthal and Jacobson research on three grounds: 
a) the same IQ test was used for the pre- and post-tests; b) the 
teachers administered the tests; c) the child was the unit of 
analysis instead of the classroom. In addition. Claiborn (1969) 
has argued that many of tie findings are unconvincing since they 
did not ream standard lovels of significance and were not predicted 
prior to the investigation. Rosenthal (1968. 1969b. 1973) has 
convincingly replied to many of these criticisms. However, other 
criticisms of the Rosenthal and Jacobson research remain (e.g.. 
see Elashoff & Snow, 1970. 1971; Rosenthal & Rubin, 1971). As 
Snow (1969) has argued, "Rosenthal and Jacobson will have made an 
important contribution if their work prompts others to do sound 
research in this area. But their study has not come close to provi- 
ding adequate demonstration of the phenomenon or understanding of 
its process." At the present time, no other conclusion regarding this 
research has stimulated a number of studies exploring various aspects 
of teacher bias, or self-fulfilling prophecy, effects, however. It 
is these which shall be reviewed next. 

Claiborn (1969) attempted to replicate the Rosenthal and Jacobson 
research. Not only were some of the teachers led to believe certain stu- 
dents would show much progress intellectually during the remainder of 
the year, but some of the classrooms were observed in order to obtain 
data concerning the student-teacher interactions. The results indicated 
no differential gain in IQ between the experimental and control children. 
Furthermore, there was no indication that teachers behaved differently 
toward the control and experimental children. Since the biasing state- 
ments were introduced well into the school year (Spring semester) and 
since the length of the study was only 2 months, the results are difficult 
to interpret. Perhaps the teachers had their own well -formed opinions ot 
the students' potential and the opinion of an "outsider" was just not seen 
as valid. Perhaps, too, the two month interval between biasing and post- 
testing was not long enough for the effect of teacher expectancy to become 
critical to the students' performance. 
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Several other studies have attempted to replicate the findings 
of Rosenthal and Jacobson with varying degrees of success. Evans 
and Rosenthal (1969) found that for Kindergarten through fifth grades 
the boys in IYa experimental group gained more IQ points in the 
reasoning subtest than the ooys in the control group, with the reverse 
holding for the girls. There were no effects for Verbal IQ or Total 
IQ scores. Anderson and Rosenthal (1968) report a failure to replicate 
with familial retarded boys. Meichenbaum, Bowers, and Ross (i9o9J, 
using female adolescent offenders, reported that the "bloomers showed 
more improvement on objective but not on subjective tests than did the 
control group. This study is of particular interest since it focused on 
academic performance rather than IQ. Furthermore the classroom observa- 
tions revealed that the "bloomers" significantly improved in terms of 
appropriate behavior more than did the control group. 

Studies involving the tutoring situation . 

One of the eaniest attempts to replicate Rosenthal and Jacobson's 
findings was conducted by Beez (1968). Sixty graduate students in ed- 
ucation each taught a symbol learning task to one child in a summer 
Headstort program. Half the teachers were told the child they would 
teach was culturally deprived and would probably have trouble adjusting 
to school and doing well in school. Half the teachers were told the 
reverse about their children, i.e., that the child should be able to 
adjust and do well in school despite cultural deprivation. Each 
teacher was given 10 minutes to teach the child the "leanings of 20 
different symbols. Compared with teachers of "non-problem children, 
teachers of "problem" children tried to teach significantly fewer 
symbols (5.7 vs. 10.4), spent more time on '"^n-teaching activ ties, 
rated the children as lower on achieverient, intellectual abil ty and 
social competency, and generally thought the task too difficult for 
the children. The children taught by these teachers learned fewer 
symbols than the children taught by teachers expecting the children 
to do well (3.1 vs 5.9). Moreover. 77% of the children alleged to 
have better intellectual prospects learned five or more symbols but 
only 13% of the children alleged to have poorer prospects learned 
five or more symbols. These results suggest that teacher bias, in this 
cas*^ induced by the experimenters, may be translated into subtle teacn- 
ing'style effects related to the child's actual 1-arning. Inj^^er 
words, teachers may both teach differently and tr. to teach different 
amounts to children depending upon their experiences regarding the 
child's performance. 

This possibility was specifically investigated by Rubovits and Maehr 
(1971), who used a micro-teaching situation. Each of 26 female under- 
qraduaies taught a lesson on television to four sixth and seventh graders, 
The teachers were given a folder with false IQ and school program infor- 
mation for each of the children. The information indicated that two of 
the children were from the school s gifted program and two were from a 
regular track. The teaching session lasted from 43 to 60 minutes. An 
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observer recorded the student-teacher interactions. The teachers 
did not differ in the amount of attention given to allegedly "gifted" 
and "nongifted" children but the pattern of attention and praise given 
the children 1n the two groups did differ. The teachers requested 
more information from "gifted" than "nongifted" children and praised 
the statements of "gifted" children more often than the statements of 
"nongifted" children. Rubovits and Maehr have discussed the implied^ 
tions of these findings for academic achievement of "gifted" children. 
Briefly, they argue that this type of interaction may allow the 
"gifted" child to better clat^ify his thoughts regarding the material 
being taught and thereby allow him to learn more than the "nongifted" 
child. 

Carter (1969) also found differences in the teaching styles of 
tutors who were given either positive or negative information about • 
students. Carter used a task and procedure similar to Beez (1968) 
with tutors who were either internal or external locus of control 
individuals. For external locus of control tutors there were no 
bias effects. For internal locus of control tutors, however, tutors 
given positive information attempted to teach more material than tutors 
given negative information. Although they did not replicate Carter's 
findings. Panda and Guskin (1970) did report that tutors given posi- 
tive information rated the students higher in academic achievement 
and social competency than tutors given negative information, a 
replication of Beez's (1968) findings. 

Two other studies deserve brief mention here although neither 
involved tutoring. Pitt (1956) investigated the effects of teachers' 
knowledge or incorrect knowledge of students' IQ scores on the children's 
school achievement, attitudes toward school, teachers, and school work, 
teachers' ratings of student conduct, and teachers' attitudes and 
behaviors in relation to students. Teachers' knowledge or incorrect 
knowledge did not relate to achievement test performance, students' 
expressed views about various aspects of the school situation, or teachers' 
ratings of students' conduct or effort. The information given the 
teachers did. however, cause them to increase pressure on bright students 
and lower expectations for students with lower intellectual ability. This 
latter finding represents a teacher bias effect but it apparently had no 
effect on the students' learning or achievement test performance. Cahen 
(1966) had 256 college students in elementary education score subjective 
tests after being given various amounts of Information about the students 
who purportedly took the tests. Cahen (1966) reported that teachers' 
scoring of the tests was biased depending on the information given. 

These studies, in conjunction with similar work done by Flowers (1966). 
Jose and Cody (1971). and Fleming and Anttonen (1971). lead to the conclu- 
sion that false test or IQ scores, or differential track assignments do 
not relate to students' actual achievement. That is. there is no measurable 
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teacher-bias effect in the children's learning or test performance. 
These studies do. however, clearly demonstrate effects of such 
information on teachers' attitudes toward the children and in the 
way they treat different groups of children. In one study (Deez, 1968) 
the students die, indeed, learn different amounts of material. These 
studies generally suggest the valuc of examining teachar bias effects 
in real classroom situations. 

Studies of experimenter-bias (E- bias ) effect s. 

Rosenthal (1966. 1969b)has recently summarized the research 
dealing with E-bias effects. By far. the majority of these studies 
have been done with the Person-Perception Task, although verbal 
conditioning, simple motor tasks, as well as other tasks hive 
been used in these investigations. The general procedure is to 
tell a group os E^s to expect to obtain one type of data from 
certain subjects (Ss) (e.g.. high anxious 3s) and to expect ano- 
ther type of data from other Ss (e.g.. low anxious Ss). Unknown 
to the Es. these Ss have been "random1y assigned to groups, e.g.. 
high or^ow anxious. The data obtained by these E^s from the 
differently-designated Ss is then compared to see if Es obtained 
the expected data. For example, in the Person-Perception Task. 
S simply rates photographs of individuals on the degree of success 
or failure, on a scale ranging from extreme success (+10) to 
extreme failure (-10). which S thinks that the pictured individual 
has just experienced. Of coarse, the photographs have previously 
been rated by large groups of Ss as being expressive or neither 
success nor failure (mean photo rating Is zero for each photo). The 
Es are biased to expect relatively high (+5) or relatively low (-5) 
ratings from certain Ss. A total of at least 57 experiments using 
essentially the same procedures and subject pooulations have been 
conducted by various investigators (Rosenthal. 196*). These studies 
have shewn that Es tend to obtain the data they have been led to 
expect. 

Since extrapolations from the E-bias literature are drawn with 
reference to general experimental research in psychology and education, 
it is critical that the ecological validity of the expectancy manip- 
ulation be established. Few principal investigators are told to 
expect certain data. Rather, they i-^tuit their expectancies from 
large research literatures. To the extent that ^-expectancy effects 
are obtained only when E is told what to expect and are not obtained 
when E postulates some outcome from his own knowledge and intuition, 
the extrapolations from this research to general experimental psychology 
are likely to be incomplete or faulty. Several studies have Investigated 
whether the method of inducing bias into E is related to E-bias effects. 
Induction has been loosely defined and Includes methods such as allowing 
E to generate his own preductions. telling E what to expect, and asking E 
to intentionally try to bias S. 

Marcia (1961) asked half the Es to estimate the magnitude of the 
scores they expected to obtain from Ss in the Person Perception Task. 
The other half of the Es were told to expect certain scores from their 
Ss. Marcia found no evidence of E-bias effects for either group of 
Es. In a second study (Marwit & Harcia. 1967) half of the Es were told 
to expect either many or few responses 1n the Holtzman Inkblot Test and 
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half were askeo to predict whether they would obtain many 
or few responses. Both groups of Es obtained data consistent 
with their expectations, with no difference in magnitude between 
the two types of biased Is. 

Johnson and Adair (1970). using the Word Association Task, 
manipulated the involvement of the Principal Investigator (PI) 
in the conducting of the experiment by having the PI visit half 
the Es during the testing and express concern for the findings 
of tFe experiment. The. other half of the Es were visited equally 
often but no mention ofpe results was made. The overall expec- 
tancy effect wus significant, i.e., there were greater latencies 
to exam-related words for Ss in the Exam Condition, but this effect 
did not interact with invoTvement of the PI. It may be that just 
by visiting £ is made more aware of the results and the import- 
ance of the results to the PI, thus eliminating any differential 
effect of the Pi's verbalizations. 

A study recently completed by Johnson (1970) included a group 
of Es who were completely informed of the purposes of the experi- 
ment and the importance of the E-bias research. These Es were 
asked to Intentionally bias S by any method except telling S. 
to perform in any certain manner. The results indicated that this 
group of is did not obtain data which was biased. It may be the 
case that if E actively or knowingly attempts to bias S there 
is no bias efTect. Perhaps Ss become aware that E is attempting 
to bias their responses and Hb not commy for fear of "cheating 
the advancement of Science (Orne, 1962). 

It is obvious that at this point very little is known about the 
effects of the method of inducing bias in Es. It may be the case 
that self-generated expectancies produce no E-b1as effects. Further- 
more, it may be that it is impossible for I to intentionally bias 
Ss. It is unfortunate that the E^s in these various studies were not 
questioned regarding the reasons they expected certain data, i.e., 
the reasons they held certain expectancies. A recent study by Dusek 
(1971) found that the Es not obtaining any bias effect in the marble 
dropping task expected the data consistent with the induced bias, as 
did the Es who did obtain bias effects. However, tne reasons given 
for why the E held the expectancy he stated differed for the two 
groups of EsT The £s who did obtain data biased toward their expec- 
tations stated some reason r dilated to their experience with children. 
It may be that if E has had some experience which he may bring to bear 
on the task he wilT be better able to rationally predict certain 
outcomes and will be more willing to hold a given belief than might 
otherwise be the case. If this is true, then the more ecologically 
valid question of whether behavioral scientists bias their research 
would be better answered by providing ^s with enough data and experience 



to be able to make some rational prediction regarding Ss' performance. 
The Es would then have some more concrete basis for making predictions 
than was the case in the above research and may have a greater 

stake in the results, much as the professional researcher. It would 
then be possible to have the Es test Ss and investigate, in an analogue 
manner, the question of bias Tn the present research methods used in 
psychological and educational research. 

Summary of 1 i terature review . 

A perusal of the above literature review reveals a number of 
interesting findings with respect to teacher- and experimenter- 
bias effects. It is clear from the literature review that teacher 
bias effects may eyist, particularly in the tutoring situation 
(e.g., Beez, 1958) or in the instance where teachers teach only 
a few students (e.g., Meichenbaum, Bowers, & Ross, 1969). The 
research in which the elementary school classroom has been employed 
is less clear with respect to this issue. Evidence indicating teachers 
bias students' learning or intellectual ability is either weak or 
nonexistent in the studies conducted up to this time. Evidence 
indicating that teachers do form expectations is abundunt (e.g., Brophy 
& Good, 1970; Rist, 1970), as is evidence that teachers treat students 
differently depending upon their expectations for the students' per- 
formance (e.g., Brophy & Good, 1970; Good & Brophy, 1972; Rothbart, 
Da'.fcr, & Barrett, 1971). Moreover, these expectancies, and presumably 
their behavioral manifestations, have been shown to relate to students' 
academic achievement (Brophy & Good, 1970). Part of the research described 
below was aimed at examining both teacher-bias and teacher-expectancy 
effects in order to assess their relationship to children's acadGWic 
achievement. 

A perusal of the above literature review also leads to several rather 
interesting conclusions regarding deficiencies in the research dealing 
with experimenter-bias effects. First, there is not one study which 
has used more than one age level of 5^. In fact, in all but two of the 
studies cited above the Ss were college students. It 1s the opinion of 
the writer that by studying the developmental aspects of aduU-expectancy 
effects three goals will be accomplished: a) valuable information on the 
generality of the i-bias effect across age of S^ will be added to the 
existing literature; b) the importance of ^-expectancy effects in experi- 
mental child psychology will be explored. In view of the literature 
dealing with teacher expectancy effects it is necessary that rigorous 
^experimental investigations with children be undertaken to delineate the 
conditions and factors related to adult-expectancy effects with children; 
c) developmental studies of expectancy effects will explicate some of 
the factors related to how an individual develops the skills necessary 
for interpreting subtle cues from another individual in a face-to-face 
situation. Second, studies of the effects of mode of inducing bias 
in Es are lacking, with the presently available research being inconclu- 
sive. Part of the research described below was aimed at each of these 
problems. 
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Experiment I 

A Longitudinal Study of Teacher-bias and Teacher-expectancy Effects 
on Elementary School Children's Achievement Test Performance 

The purpose of this axperiment was to investigate teacher-bias and 
teacher-expectancy effects on elementary school children's achievement 
test performance. Teacher-bias is defined as above, that is, an expec- 
tancy for performance as induced by the principal investigator, Anala- 
qously, teacher expectancy is defined as the teacher's own self-generated 
(stated) expectations regarding children's performance. In this experi- 
ment, as well be noted below, teacher-bias was manipulated by statements 
from the principal investigator and teacher-expectancies were measured 
by teachers' renitings rev^arding year-end academic perfo>Tnance levels. 

Subjects . 

The subjects were 32 second-qraders (C^ = 8.60 years), 13 boys and 
19 girls, and 32 fourth-graders (CA" = 10.73 years), 15 bo>s and 17 girls, 
attending a school serving primarily a lower-class population. There 
were 16 subjects in each of two classrooms In each grade level. 

Procedure . 

During the first week of the 1971-1972 academic year several sub- 
tests from the SAT battery were administered by the principal investi- 
gator to each of the classrooms involved in the study. The subtests 
administered includea: Word Reading, Paragraph Meaning, Spelling, 
Arithmetic Computation, and Arithmetic Concepts (fourth-grade only). 
The Primary I and Partial Intermediate I batteries were used for the 
second- and fourth-grades, respectively. The SAT's were disguised as 
tests to measure potential gains in language and arithmetic skills. The 
same subtests were administered at the middle and end of the 1971-1972 
academic year. The SAT's were again administered at the beginning and 
middle of the 1972-1973 academic year, the children now being in the third- 
and fifth-grades. Subtests from the Primary II and Partial Intermediate I 
were now employed for the third- and fifth-graders respectively. It is 
important to keep in mind that the subjects were, at this time, in new 
grade levels with new teachers. 

During the initial testing session each teacher was asked to 
rank the children in her classroom from l-ji based on her expectations 
regarding their year-end performance levels in language and arithmetic 
skills. In each classroom the children ranked 1-16 were randomly and 
equally divided into an experimental and a control group. One week 
after the initial testing each teacher was given the names of the children 
in the experimental group and was told that, on the basis of the tests, these 
children should show large gains in language and arithmetic skills during 
the academic year. It should be noted that no further mention of these 
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children was made to apx teacher throughout the remainder of the study, 
a year and a half. 

Results . 

The dependent variables were total SAT raw scores for each testing 
session. Originally, the design was conceived as a three-way factorial 
arrangement, including experimental vs. control groups, grade level, 
and teacher ranking. However, due to subject attrition there v#ere not 
an equal number of subjects in each cell. * Rather than solve the analysis 
problem by application of the unweighted means solution to the analysis 
of variance the multiple regression approach of Cohen (1968), Overall and 
Spiegel (1969), and Overall (1972) was employed. 

The results of the multiple regression analyses are summarized in 
Table 1. The means associated with the main effects of the multiple 
regression analyses are presented in Table 2. As may be seen in Table 1 
the bias manipulation (Experimental Condition) was not significantly 
related to SAt performance on any of the five testing occasions. Grade 
Level was significantly related to performance on SAT-2, SAT-3, and SAT-4. 
As may be seen in Table 2, the younger Ss scored higher than the older 
Ss on SAT-2 and SAT-3 with the reverse Feing the case for SAT-4. 

Teacher ranking was strongly and consistently related to SAT per- 
formance on each testing occasion. In general, the higher the 
teacher's ranking the higher the child's SAT performance (see Table 2). 
The correlations between SAT performance and Teacher Ranking, presented 
in Table 3, reflect the strength of the relationships detected in the 
multiple regression analyses. 

Conclusions . 

The findings are quite conclusive with respect to the importance of 
teacher-bias and teacher-expectancy effects on children's academic 
performance. Clearly, simplv telling teachers certain students would be 
performing well at the end of the academic year was not sufficient to increase 
there students' SAT performance. It appears that the teachers biased neither 
the SAT performance nor the classroom learning of the children in the exper- 
imental or control groups. This appears to be the case for both short- 
and long-term effects due to teacher-bias. This finding does not replicate 
the findings of Rosenthal and Jacobson (1968). When considered in conjunction 
with other research (e.g., Claiborn, 1969; Anderson & Rosenthal, 1969; Evans 
& Rosenthal, 1969; Fleming & Anttonen, 1971; Jose & Cody, 1971) which has also 
failed to replicate the Rosenthal and Jacobson findings, however, it seems 
quite clear that teachers do not bias students' performance. 

Teacher Ranking was related to SAT performance on each testing occasion. 
Children ranked higher by the teacher had higher SAT scores than children 



Analyses of SAT-1 scores revealed no differences between the children remaining 
available at the end of the first year or middle of the second year and those 
lost throughout the experiment. 
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Table 2 
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Mean Stanford Achievement Test Scores for 
Each Condition at Each Testing Session 



Condition 



Condition 
Experimental 
Control 



SAT-1 



58.35 
56.63 



Mean Test Score 
SAT-2 SAT-3 



79.15 
76.51 



96.30 
96.71 



SAT-4 



75.53 
68.58 



SAT- 5 



84.21 
81.11 



Grade Level 
Second (Third) 
Fourth (Fifth) 



57.59 
57.38 



85.63 
70.37 



111.38 
79.95 



65.09 
81.63 



84.91 
79.56 



Teacher ranking 
Rankings 1-4 
Rankings 5'-8 
Rankings 9-12 
Rankings 13-16 



75.56 
60.13 
52.50 
41.75 



96.55 
81.48 
73.21 
60.06 



116.13 
100.82 
89.08 
80.00 



101.75 
67.13 
76.36 
49.73 



116.50 
86.50 
77.82 
60.09 



^ The grade level listed in the parentheses refers to SAT-4 and SAT-5. 

^ Teacher ranking was entered as a continuous variable in the multiple 
regression analyses. The data are grouped here simply for convenience. 
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Table 3 



Correlations Between Teacher Ranking and SAT Performance 
Across All Grade Levels and Conditions 



SAT 



1 -.59 

2 -.67 

3 -.55 

4 -.55 

5 -.67 



n = 51 for SAT-1, SAT-2. and SAT-3 and n = 38 for SAT-4 and 
SAT-5. All r's are statistically significant (p <.001). 



ranked lower. This effect has been deemed a teacher expectancy 
effect since it reflects the teacher's own self-generated expec- 
tancy for the child's performance. 

There is some evidence in the present study which supports the 
argument that this teacher-expectancy effect ib not a teacher-bias 
effect in the Rosenthal and Jacobson (1968) sense. The ^^i^^J P^" 
of evidence is the correlation between Teacher Ranking and SAT-1 
performance. If this teacher-expectancy effect were due to teachers 
somehow biasing the test performance of the children it is unlikely that 
the magnitude of the correlation would have been as large. Second, 
teacher ranking was related to SAT performance 12 and 18 months after 
the ranking was made, the students now being advanced one grade level 
and under the tutelage of a new teacher. It is unliely that this could 
be the case were the relationship based on a biasing influence by the 
teacher of the students' performance. Finally, the teachers reported that 
their rankings were basdd on criteria directly relevant to academic 
abilities, e.g.. previous grades, readiness tests, and current classroom 
performance. 

These effects due to teacher-expectancy appear to reflect the 
teacher's ability to accurately estimate the relative academic ability 
of the children in her classroom. The longitudinal data presented 
above appear to support this contention. Future research should 
focus on determining the exact bases used by teachers to form expect- 
ancies regarding students' abilities and the relationship of these 
bases to actual student performance as well as to teacher-student 
interaction in the classroom. Such research will not only clarify 
the nature of teacher-expectancies but also the role of the teacher 
in the child's cognitive and social development. 
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Experiment II 

A Developmental Study of Experimenter Bias 
Effects with Children as Subjects 

As was noted in the above literature review the research on 
experimenter-bias (E-bias) effects has been deficient in examining 
a number of areas, especially in experimental child psychology 
research. Although E-bias effects have been shown in studies using 
children as Ss (e.g" Dusek, 1971, 1972), no information regarding 
developmentaT trends in susceptibility to E-bias effects 1^ available. 
The major purpose of this experiment was to test for possible developmental 
trends. A secondary purpose was to include both male and female Es 
in order to assess the possibility that boys and girls are differentially 
susceptible to subtle influences from men and women. 

Subjects . 

The subjects were 48 first (C]^ = 7 yrs. 4 mo., SD = 9 mo.), 48 third- 
(CA = 9 yrs. 5 mo. , SD = 7 mo.), and 48 fifth-graders (CA = 11 Y^s. 6 mo., 
SD = 8 mo.). Half the children in each grade level wcrp males and half 
were females. The children attended a school serving primarily a lower-class 
neighborhood. 

Experimenters . 

The experimenters were six male and six female college students 
(CA = 19 yrs 11 mo., SD = 7 mo.) enrolled in the introductory psychology 
course at Syracuse University. Each E participated in both a group and an 
individual training session prior to testing the children (see below). During 
the experiment each E tested two boys and two girls from each g'^J**^' ^vel . 
Half the experimenters of each sex were randomly assigned to each bias condition. 

Apparatus . 

The apparatus has been described in detail elsewhere (Stevenson & Fahel , 
1961). Briefly, it consisted of a table with two bins and a transverse 
upright panel which served as a shield. The left bin contained approx mately 
innn ma^Kipc; thp table top above the right bin contained five randomly 
placed holes through which the marbles could be dropped. An Esterline Angus 
Event Recorder, shielded from S's view, was connected to microswitches below 
the holes and was used to obtain an automatic and permanent record of 5 s 
responses. The experiment was conducted in an area of the school free of 
distractions. 

Procedure . 

Experimenter training . Each experimenter was randomly assigned. to one 
of the two bias conditions. All experimenters (n=6) in the same bias condition 
attended the same group training session. The experimenters were shown the appara- 
tus and the procedure was briefly outlined and demonstrated. The experimenters 
were then told they would be testing children in the public schools and the 
following biasing statement was made: 

We have used this task with thi: age children before and it 

has been found to be a sensitive measure of children's motivation. 

In fact, previous research shows that one of the findings we 
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should expect to get is that boys (girls) will 
drop the marbles faster than girls (boys). 

The procedures were then demonstrated again and each experimenter 
practiced the task. Each experimenter subsequently met with a graduate 
assistant to further practice the procedures. 

Exper{;.iental task . The experimenter brought the subject to the 
testing room and reaH^he instructions telling the subject to pick 
the marbles up one at a time and put their into the holes. As the 
subject picked up the first marble the experimenter started a stop 
watch and allowed the child to perform at the task for seven minutes. 

During the first or baseline minute of the task the experimenter 
remained an attentive but nonresponsive observer of the subject's performance, by 
glancing at the marbles and holes while avoiding looking at the subject. 
During the next six minutes* the experimental period, the experimenter 
used verbal reinforcers on a Fixed Interval 30-second schedule contingent on 
a marble drop. Six reinforcing statements were used: Good, Fine, That's good. 
That's fine. Very good. Very fine. Each subject received each statement 
twice in a predetermined random order. Each experimenter tested two boys 
and two girls at each grade level using this procedure. 

Design . 

The experimental procedures required a 3 (Grade Level) x 2 (Sex of 
E) x 2 (Bias Conditions) x 2 (Sex of S) x 7 (Minutes) analysis of 
variance design with six subjects in The smallest cell. 

Results . 

Dependent measures . There were two dependent variables of interest 
in the study: the base rate of response ( the number of marbles dropped 
in the first minute of the task) and a series of difference scores com- 
puted separately for each subject by subtracting the number of marbles 
dropped in the first minute from the number of marbles dropped in each 
subsequent minute (Minutes 2-7). The correlation between the base-rate 
score and the average difference score was -.4135 (n. - 144, p <.01^ 
Indicating that although the two variables are correlated only 17.1% of 
the variance in the difference scores is accounted for by the initial base 
rates. 

Analysis of base-rate scores . The base-rate scores were subjected to 
a 3 (Grade LeviT) x 2 (Sex of Experimenter) x 2 (Bias Conditions) x 2 
(Sex of Subject) analysis of variance (see Table 4). The mean base-rate 
scores for each main effect are presented in Table 5. 
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Table ;• 

Analysis of Variance of Base-Rate Scores 



Source ^f MS 



Grade Level (A) 2 604.000 23.16 <.0Q1 

Sex of E (B) 1 4.000 <1 

Bias Condition (C) 1 103.313 3.96 

Sex of S (D) 1 :«8 < 

A X F 2 14.65G <1 

A X C 2 56.188 2.16 

A X D 2 14.094 <1 

BxC 1 1.813 

B X D 1 8.000 <1 

C X D 1 277.813 10.65 <.0Ol 

Ax BxC 2 31.375 1.20 

A X B X D 2 3.063 <1 

A X C X D 2 68.375 2.62 

B X C X D 1 84.000 3.22 

AxBxCxD 2 21.750 <1 

error 120 26.078 
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Table 5 

Mean Base-Rate and Mean Difference Score 
for Each Main Effect 



Effect 



Mean 
Base-Rate 



Mean 

Difference Score 



Grade Level 
First 
Third 
Fifth 



20.69 
23.46 
27.73 



-.44 
1.59 
1.59 



Sex of £ 
Male 
Female 



23.79 
24.12 



.95 
.88 



Bias Condition 
To Males 
To Females 



23.11 
24.81 



1.06 
.76 



Sex of S^ 
Male 
Female 



24.01 
23.90 



.37 
1.46 
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As may be seen in Table 4, there were two significant effects. 
The significant Grade Level effect reflected a general increase in 
base rates with increasing grade levels. Newman-Keuls comparisons 
(Winer, 1962, p. 80) revealed that the means for each grade level 
were significantly different from each other (all p <.01). The 
Bias Condition x Sex of Subject interaction was also significant. 
The means are presented in Table 6. Individual comparisons (Winer, 
1962, p. 207ff) revealed a significant Bias Condition effect only 
for the boys (F = 13.79, df = 1/120, p <.001), but significant 
Sex of Subject effects for both Bias to Males (F = 4.88, df = 1/120, 
p <.05) and Bias to Females (F = 5.76, df = 1/120, p <.05). 

Analysis of Difference Scores . The difference scores were subjected 
to a 3 (3rade Level) x 2 (Sex of Experimenter) x 2 (Bias Condition) 
X 2 (Sex of Subject) x 6 (Minutes) analysis of variance (See Table 7). 
The means V'or the main effects are presented in Table 5. The significant 
Grade Level effect reflected higher difference scores for the third- 
and rifth-graders than for the first-graders (See Table 5). Female 
S^s had higher difference scores than male Ss (See Table 5). 

The Grade Level x Sex of Subject and Bias CondUion x Sex of Sub- 
ject interactions were also significant. The means for each effect 
are presented in Tables 8 and 9, respectively. Tests of simple effects 
(Kirk, 1968, p. 289ff) on the means in Table 8 revealed a significant 
Sex of Subject effect only for the fifth-graders (F = 12.20, df = 1/120, 
p <.001). The Grade-Level effect was significant for both the boys 
(F = 3.96, df « 2/120, p <.05) and the girls (F = 11.90, df = 2/120, 
p <.01). Individual comparisons (Winer, 1962, p. 344) conducted on 
the means in Table 9 revealed significant Bias Condition effects for 
both the boys (F = 5.52, df = 1/120, p <.05) and the girls (F = 29.20, 
df = 1/120, p <.001) and significant Sex of Subject effects for Bias 
to Males (F = 18.54, df = 1/120, p <.001) and Bias to Females (F = 11.88, 
df = 1/120, p <.001). 

These interactions are of limited interest in view of the 
significant Grade Level x Bias Condition x Sex of Subject interaction 
(See Table 10). Individual comparisons (Winer, 1962, p. 344) were con- 
ducted on the bias condition x sex of subject means separately for 
each grade level and tests of simple effects (Kirk, 1968, p. 289ff) were 
conducted on the grade level x sex of subject means for each bias con- 
dition. The Individual comparisons revealed no significant Bias Con- 
dition or Sex of Subject effects at the first-grade level. At th2 third- 
grade level there was a significant Bias Condition effect (F = 12.20, 
df = 1/120, p <.001) for the males but not for the females. There were 
significant sex differences for both the Bias to Male (F = 7.36, df = 
1/120, p <.01) and Bias to Female (F = 4.22, df = 1/120, p <.05) con- 
ditions. At the fifth-grade level the bias conditions were significantly 
different for both the male (F = 7.72, df = 1/120, p <.01) and female 
Ss (F = 16.11 , df = 1/120, p <.O01). The tests of simple effects revealed 
that for the Bias toward Males condition there was a significant age effect 
(F = 10.43, df = 2/120, p <.0Ol) for the male Ss but not for the female 
Ss. In the Bias toward Females condition the age effect was significant 



Table 6 

Mean Base- Rate Scores for the Bias Condition 
X Sex of Subject Interaction 



Sex of Bias Condition 

Subject Males Females 

Males 21.78 26.25 

Females ::*.44 23.36 
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Table 7 



Analysis of Variance of Difference Scores 



Source 


df 


MS 


F 


P 


Grade Level (A) 


2 


394.066 


7.22 


<.001 


:ex of E .(B) 


1 


1 .260 


<1 




Bias Condition (C) 


1 


19.2B0 


<1 




Sex of S (D) 


1 


254.584 


4.67 


<.05 


A X B 


2 


19.448 


<1 




A X C 


2 


81.816 


1.50 




A X D 


2 


225.876 


4.14 


<.05 


B X C 


1 


13.751 


<1 




B X D 


1 


10.446 


<1 




C X D 


1 


1641 .760 


30.10 


<.001 


A X B X C 


2 


6.689 


<1 




A X B X D 


2 


78 . 300 


1 .43 




A X C X D 


2 


166.774 


3.06 


<.06 


B X C X D 
^# 


1 


19.862 


<1 




A X B X C X D 


2 


4.689 


<1 




error 


120 


54.546 






Minutes (E) 


5 


.874 


<1 




A X E 


10 


12.205 


2.48 


<.01 


B X E 


5 


8.841 


1 .80 




C X E 


5 


3.074 


<1 




D X E 


5 


3.493 


<1 




A X B X E 


10 


6.853 


1 .40 




A X C X E 


10 


6.297 


1 .28 




A X D X E 


10 


5.651 


1.94 




B X C X E 


5 


4.660 


<1 




B X D X E 


5 


2.659 


<1 




C X D X E 


5 


11.008 


2.24 


<.06 


A X B X C X E 


10 


3.480 


<1 




A X B X D X E 


10 


4.914 


1.00 




A X C X D.x E 


10 


3.996 


<1 




B X C X D X E 


5 


8.826 


1.80 




AxBxCxDxE 


10 


1.891 


<1 




error 


600 


4.911 
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Table 8 

Mean Difference Score for the Grade Level 
X Sex of Subject Effect 

Sex of Subject Grade Level 

First Third Fifth 



Males -.75 1.79 .07 

Females -.12 1.38 3.11 



Table 9 

Mean Difference Score for th^ bias Condition 
X Sex of Subject Effect 



Sex of subject Bias Condition 

Males Femeles 



Males 1.90 -1.16 

Females .23 2.F8 
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Table 10 

Mean Difference Scores for the Grade Level 
X Bias Condition x Sex of Subject Interaction 



Bias Condition 


Sex of 




Grade Level 






Subject 


First 


Third 


Fifth 


Males 


Male 


-.03 


3.94 


1.78 




Female 


T.56 


.60 


.64 


Females 


Male 


-1.47 


r:36 


-1.64 




Female 


.31 


2.17 


5.58 
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Table 11 

Mean Difference Scores for the Grade 
Level X Minutes Interaction 



Grade Level 



First 
Third 
Fifth 



Minutes 
4 5 



.12 -.21 -.75 -.46 
1.02 1.79 1.33 1.83 
1.62 1.27 1.71 1.56 



-.19 -1.15 
1.88 1.67 
1.02 2.35 
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for both the males (F = 6.45, df = 2/120, p .01) and the females 
1?^ 18 86, df = 2/120, p .001). In effect, both the individual 
comparisons and the tests of simple effects rayeal f^d^ level 
differences and sex differences in susceptibility to subtle cues 
emitted by the experimenters. 

There were two within subjects comparisons which were also fou- 
nd ta be significant. The means for the Grade Level x Minutes Inter- 
Sct^n may be^seen ^ , Table 11. In general, the performance rates of 
the first-graders declined over time but the perf6rmance rates of the 
third- end fifth graders increased over time and then remained relatively 
stablt. 

The Bias Condition x Sex of Subject x Minutes interaction approached 
the traditional D^.OS level of significance. The performance curves 
refle tei brVhis effect are shown'in Figure 1 ^"^^--^'^^^^/e^P^jl^?"/ 
(Winer 1962) of each pair of means for each minute revealed no signif- 
clnrdifferen^es between the sexes in the Bias to Males condition I ndi - 
vidSa-r comparisons for the Bias to Females condition, however, revealed 
significant sex differences at each minute. 

Discussion . 

There were two major objectives in the P'^^^J^.J^^Jy ^.'l^JJif ^IJes" 
the possibility of developmenta trends n susceptbilty to subtle cu^^ 
emitted by experimenters, and b to examine possible ^^^^^'^ential bias 
^fects due to male and female experimenters. The data provided an 
afftrmative answer to the fonner objective Ji^d a negative anrwer to he 
latter objective. Data with respect tt each objective will be discussed 
in turn. 

Generallv speaking, the experimenters did bias the performance of 
the children The significant Bias Condition x Sex of Subject interaction 
4n tho Inriviis of the difference scores reyeals differences in the predicted 
iire tio , 1 e , o'v pe fS^ than girls for experimenters 

b aled ?Sward m^les and girls performed at a higher rate than boys for 
experimenters biased toward girls. Although this same nteract o was si 
icant in the analysis of base-rate scores, suggesting ^^e possibility that 
the difference score analysis is contaminated by regression effects (Parton 
Tros .™ Stevenson & Hill, 1966), there is some evidence to suggest this 
is not the case. First, the base-rate scores for the females did not uiffer 

iltween the Bi.s Conditions but the difference ^"^^^..^itfican'tlv di? erent 
base-rate scores for the three age groups were each ^ignij ^ntly different 
from the other but in the difference score analysis the Jhird- and fifth 
nrades had identical and positive difference scores and the first-graae naa 
f ne t e Jiffe^enie sco're. Although part 07%) of the ?^"J"J ^.Jf " 
scores can be attributed to initial base-rate level because o^/he Base-Rate 
X Difference Score correlation (r = .41) the above data suggest this is not 
an over-riding factor. 

The Bias Condition x Sex of Subject x Minutes interaction (^--gure 1) 
revea ed no significant Sex of Subject differences for e^P^^if^ters biased 
Jowfrd .ales, although the means were in the predicted direction, but cons stent 
Sex of Subiect differences for experimenters biased toward fema es. This is a 
replication of a finding reported by Dusek (1972) in which female experimenters 
biased toward boys did not significantly bias the performance of boys or 
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girls. In the present study neither male nor female experimenters biased 
toward boys could erfect significant changes over time. The basis of this 
finding is unclear. 

The major finding with respect to the predictions was the Grade Level x 
Bias Condition x Sex of Subject interaction (Table 10) which revealed clear- 
developmental trends in susceptibility to experimenter bias effects. At the 
first-grade level there were no significant Bias Condition effects for either 
the male or female Ss, although'the means were in the predicted directions. 
At the third-grade level the Bias Condition effect was significant for the 
males, the mean difference score was higher if the experimenter was biased 
toward males than females; for the females the Bias Condition effect was not 
significant although the means were in the predicted direction. At the fifth- 
grade level the Bias Condition effect was significant, for both the male and 
female subjects with the means in the predicted direction. 

The above findings indicate clear developmental and sex of subject 
trends in susceptibility to experimenter-bias effects. Although the exact 
bases of these trends is difficult to elaborate at the present time it 
may be that as children become more developmental ly mature they are better 
able to interpret the subtle cues emitted by the experimenter and tend to 
comply with the interpretation placed on the cues. The processes involved 
may be similar to those examined by Flavell (1968) in connection with 
children's role-taking and communication skills. 

Sex of experimenter was neither a significant main effect nor involved 
in a significant interaction in the analysis of either the base-rate or 
difference scores. Thus. the bias effects obtained were independent of 
sex of experimenter and support the earlier findings of Dusek (1971, 1972} 
with respect to experimenter bias by male and female experimenters testing 
children. 

Future research aimed at testing a) the reliability of the above 
findings and b) the basis of the observed grade level differences is 
obviously called for. The results of such a series of experiments should 
aid the development of a theory of experimenter bias effects. Such research 
should also lead to a better understanding of childhood socialization vis-a-vis 
interaction with adults. 
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Experiment III 

Adult Expectancy Effects: Self -genera ted 
versus Induced Expectancies 

As noted in the literature review, when evaluating adult 
expectancy effects there Is but little evidence relating to the 
importance of the manner by which the adult acquires the expec- 
tancy for the to-be-produced outcome. Some of the available 
evidence (e.g.. Bootzin. 1971) suggests that self -generated ex- 
pectancies relate more to obtained bias than expectancies induced 
by the principal investigator. However, there is other evidence 
(e.g.. Marcia. 1961; Marwit & Marcia. 1967) which does not support 
this position. As a result, the issue remains unresolved. Exper- 
iment III was aimed at assessing the importance of mode of devel- 
opment of adult expectancies vis-a-vis the effectiveness of adults 
to bias the simple motor performance of children. 



Subjects . 

The subjects were 48 kindergarten children (CK = S yrs. ^ "lo., 
SO = 8 mo.), half males and half females. The children attended a 
school serving primarily a lower-class neighborhood. 



Experimenters . 

The experimenters were 12 male college students (Ci^ = 19 yrs. 9 mo.. 
SO = 13 mo.) enrolled in the introductory psychology course at Syracuse 
University. During the experiment each E tested two boys 3"°^ ^^o girls. 
Each E was randomly assigned to one bias condition and one Induction condition. 



Apparatus . 

The apparatus was identical to that used in Experiment II. 



Procedure. 



Exp erimenter training . With the exception of the group training 
session the experimenter training was essentially identica to that 
of Experiment II. Each experimenter was trained individually, tx- 
oerimenters assigned to the Induced Bias Condition were given the 
same statement as was given in Experiment II. Experimenters in the 
Self -generated Bias Condition were asked to predict whether boys or 
girls would drop the marbles faster. Each experimenter practiced the 
task administration procedures with a graduate student. 



the experimental task were 

identii - . . - - - 



Experimental task . The procedures for i 
tical to those employed in Experiment II 



Design . 

The design of Experiment III was a 2 (Induction Condition) x 
2 (Bias Condition) x 2 (Sex of Subject) x 7 (Minutes) factorial design 
with six subjects in the smallest cell. 
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Results 

Dependent measures . As in Experiment II, the dependent measures were 
the base-rate of response and six difference scores, one for each minute of 
the experimental period of the task. The correlation between the base- 
rate and the average difference score was -.62 (N = 48, p <.01), indicating 
that approximately 38% of the variance in the difference scores is accounted 
for by the initial base-rate levels. 

Analysis of base-rate scores . The base-rate scores were subjected to 
a 2 TranEtTor"Condition) rTTIias Condition) x 2 (Sex of Subject) analysis 
of variance (see Table 12). The mean base-rate scores for each main effect 
are presented in Table 13. As may be seen in Table 12 the only significant 
effect was the triple interaction involving Induction Condition, Bias Cond- 
ition, and Sex of Subject. The means for this effect are presented in 
Table 14 Individual comparisons revealed the following: a) the only signit- 
icant sex difference (F = 7.08, df = 1/40, .0- <.05) was in the Self -Generated 
Bias to Females Condition; b) the only significant Bias Condition difference 
was for the male Ss in the Self -Generated Induction Condition. 

Analysis of Difference Scores . The difference scores were subjected to 
a 2 (TndUctionTondition) x TW&s Condition) x 2 (Sex of Subject) x 6 
(Minutes) analysis of variance with repeated measures on the last factor 
(See Table 15). The means for the between-subjects main effects are 
presented in Table 13. The only significant between-subjects effect was 
the Bias Condition x Sex of Subject interaction (see Table 16). Individual 
comparisons (Winer, 1962, p. 344) revealed significant Sex of Subject effects 
for Bias to Males (F = 6.48, df = 1/40, p <.05) and Bias to Females (F - 22.90, 
df = 1/40, p <.001) and significant Bias Condition effects for Male Ss 
(F = 10.36, df = 1/40, p -.01) and Female Ss (F = 16.92, df = 1/40, p <.0Ol). 

There were several significant within-subjects effects. The significant 
Minutes main effect (See Table 17) reflected a general decrease n «^ate of 
response during the experimental period of the task The Induction Condition 
X Minutes interaction was significant (see Table 17) and reflected a general 
decrease in rate of response for Ss in the Induced condition and, orally, 
an Increase and then decrease in rate of response for Ss in the Self-Generated 
condition. The Induction condition x Bias Condition x Minutes Interaction 
was also significant (see Figure 2). Individual comparisons revealed that for 
the Induced Condi don there were significant Bias Condition effects for 
minutes 4 (F = 6.50, df = 1/200, p <.05 , 5 F = 6.03, df = 1/200, P <.05), 
6 (F = 6 36, df = 1/200, p <.05), and 7 (F « 6.56, df = 1/200. p< .05) , but 
for the Self -Genera ted Condition the only significant Bias Condition effect 
was for minute 4 (F = 4.27, df = 1/200, p <.05). 



Discussion. 

The major focus of Experiment III was to investigate the effects of 
mode of inducing expectations In the experimenter in the obtaining of 
experimenter-bias effects with children. Although the analysis of the 
difference scores revealed a significant Bias Condition x Sex of Subject 
interaction, indicating a significant experimenter-bias effect, this effect 
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Table 12 

Analysis of Variance of Base-Rate Scores 



Source 

Induction Condition (A) 
Bias Condition (B) 
Sex of Subject (C) 

A X B 

A X C 

B X C 

A X B X C 

error 



df 


MS 


F 




17.516 


<1 




1.688 


<1 




17.520 


<1 




46.023 


2.44 




6.023 


<1 




50.020 


2.66 




88.023 


4.67 


40 


18.838 
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Table 13 

Mean Base-Rate and Mean Difference Score 
for Each Main Effect 



Effect Mean Mean 

Base-Rate Difference Score 



Induction Condition 

Induced 17.92 -.75 

Self-generated 16.71 .56 

Bias Condition 

To Males 17.50 .J3 

To Females 17.12 .88 

Sex of Subject 

Males 17.92 .10 

Females 16.71 1.20 



Table 14 

Mean Base-Rates for the Induction Condition x Bias Condition 

X Sex of Subject Interaction 

Induction Bias Sex of Subject 
Condition Condition Male Female 

Ir.duced To Males 19.67 18.50 

To Females 16.67 16.83 



Self-Generated To Males 

To Females 



14.50 17.33 
20.83 14.17 
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Table 15 

Analysis of Variance of Difference Scores 



Source df MS F £ 



Induction Condition (A) 


1 


2.722 


<1 




Bias Condition (B) 


1 


14.222 


<1 




Sex of Subject (C) 


1 


86.681 


2.44 




A X B 




102.722 


2.90 




A X C 




8.681 


<1 




B X C 




946.125 


26.675 


<.001 


A X B X C 




5.014 


<1 




error 


40 


35.469 






Minutes (D) 


5 


28.431 


6.60 


<.001 


A X D 


5 


10.631 


2.47 


<.05 


B X 0 


5 


.631 


<1 




C X D 


5 


6.255 


1.45 




A X B X D 


5 


9.797 


2.28 


<.06 


A X C X D 


5 


.556 


<1 




B X C X D 


5 


3.867 


<1 




A X B X C X D 


5 


4.256 


<1 





error 200 4.306 
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Table 16 



Mean Difference Scores for Each Bias 
Condition x Sex of Subject Subgroup 



Bias Condition Sex of Subject 

Male Female 



To Males 1.69 -.83 

To Females -1.49 3.24 



Table 17 



Mean Difference Scores for the Minutes x 
Induction Condition Interaction 



Minutes 

Induction 2 3 4 5 6 7 

Condi tion 



Induced 


2.04 


1.42 


.25 


.79 


.17 


-.17 


Self-generated 


.88 


.92 


1.71 


.79 


.17 


-1.12 


Mean 


1.46 


1.17 


.98 




.17 


-.64 
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did not interact with Induction Condition. Induction Condition did 
interact with Minutes, and with Bias Condition and Minutes. However, 
these effects are not readily interpretable given current theorizing 
in the area. The data would appear to support the findings and 
theorizing of Marcia (1961) and Marwit and Marcia (1967), indicating 
no significant differences due to Induction Condition. 

It must be pointed out that both the correlation between the 
base-rate and the average difference score, as well as the significant 
Induction Condition x Bias Condition x Sex of Subject interaction 
in the analysis of the base-rate scores, lead to the possibility of 
relatively strong regression effects in the analyses of the difference 
scores. Although it is not readily possible to accurately evaluate 
the extent of this possible regression effect due to initial base 
rate, it may be that the experimenter-bias effect occurs primarily in 
the initial interactions in the experiment. That is, the experimenter- 
bias effect may be reflected in the base-rate scores as opposed to 
the difference scores. The current investigation was not designed in 
such a way as to allow the veridical ity of this possibility to be 
examined. Data reported by Rosenthal (1966) and Friedman (1967), 
however, lend credence to this view. 



-35- 



References 

Anderson. D. F.. & Rosenthal R Sc.e effects^f interper^ 

V a ^iivpr M Fact, fiction and the experimenter bias effect. 

Y a Silver M Pitfalls in data analysis and interpretation: 
'''''a Veply to Rosenthal ^ pivcholoqical Buljetin Monoflra^h Suppl^ent , 

1968, 70, 48-62. (b) 

1963. Pp. 605-606. 
Bootzin R induced and stated expectancy in f P^^ijenter bias Proceed- 
ings of the 77th Annual Convention , APA^ 1969, 4(1 ) , 365-6. 

, r 0 r««^ T I Teachers' communication of differential 
TZ%l^gl UijCit^ 1970. 61. 365-374. 

New York, September, 1966. 
Indiana University, 1969. 

n n r Perceived reward value of teacher reinforcement and attitude 

■''"nX^r^each^r'tnlppTcation Of N^co^^ balance theory. Journal of 
Educational Psychology , 1972, 63, 418-4^^. 

, . r rhiiH*-on'<; nerceotion of their teachers' feelings 

tS^;irf!;crpJr%oo| achievement, and behavior. 

■Tnnrn^nf Experimental Education , 1960, 29, lOZ-nu. 

Dusek J B. Experimenter bias in performance of children at a simple motor 
°"^ t..i; Developmental Ps^^cholosx, 1971 , 4, 55-62. 



-36- 



Dusek. J. B. Experimenter-bias effects on the simple motor task 
performance of low- and high-test anxious boys and girls. 
Psychological Reports , 1972, 30, 107-114. 

Dusek, J. B., & O'Connell, E. J. Teacher expectancy f^^ects on the 
achievement test performance of elementary school children. 
Journal of Educational Psychology , 1973, in press. 

Elashoff, J. D., & Snow, R. E. A case study in statistical inference: 

Reconsideration of the Rosenthal -Jacobson data on teacher expectancy. 
Technical report #15, Stanford Center for Research and Development 
in Teaching, School of Education, Stanford University, December, 1970. 

Elashoff, J. D., & Snow, R. E. (Eds.) Pygmalion reconsidered. Worthington, 
Ohio: Charles A. Jones, 1971. 

Evans, J., & Rosenthal, R. Interpersonal self-fulfilling prophecies: 
Further extrapolations from the laboratory to the classroom. 
Proceedings of the APA Convention , 1969, 371-372. 

Flanagan, J. C. Test of general ability: technical report. Chicago: 
Science Research Associates, 1960. 

Flavell, J. The development of role-taking a_nd commu ni ca ti on skills 
in children . New York. John Wiley & Sons, Inc., 1968. 

Fleming, E. S., & Anttonen, R. G. Teacher expectancy as related to the 
academic and personal growth of primary-age • -^°g°f ^P^ - ^ -,§f 

the Society £21 ^^^^^T^L Development , 1971 , 36 (50^, Serial 

No. 145. 

Flowers, D. E. Effects of an arbitrary accelerated group placement on 

the tested academic achievement of educationally disadvantaged students 
Unpublished doctoral dissertation. Teachers College, Columbia University, 
1966. 

Friedman, N. Th. nature of Psychological Research: Jhe psychological 

Nw York: Basic SB^s, 1967. 

Good, T., & Brophy, J. Teacher-child dyadic interactions: A.JJ^w "lethod of 
classroom observation. Journal of School Psychology , 1970, 8. 131-138. 

Good. T., & Brophy, J. E. Behaviora^. expression of teacher attitudes. 
Journal 0^ E ducational Psycholog y, 1972, 63, 617-624. 

Jensen, A. R. How much can we boost IQ and scholastic achievement? Harvard 
Educational Review , 1969, 39, 1-123. 



Johnson, R. Inducement of expectancy and set of subjects as determinants 
of subjects' responses in experimenter expectancy research. Unpub- 
lished doctoral dissertation, University of Manitoba, 1970. 

Johnson, R., & Adair, J. The effects of systematic "^ecor^i^g ^r;: ^r vs^ 
experimenter bias on latency of word association. Unpublished manu- 
script. University of Manitoba, 1970. 

Jose', J., & Cody, -acher-pupil interaction as it relates to attempted 
changes in teacher ^-^pectancy of academic ability and achievement. 
American Educaticna ' ^ ^ search Journal , 1971 , 8, 39-49. 

Kirk, R. Experimental desig.n: Procedures for the behavioral sciences.. 
Belmont, Califi: CoTe Publishing Co., 1968. 

Marcia, J. E. The need for social approval, the condition of hypothesis^ 

making, and their effects on unconscious experimenter bias. Unpublishea 
master's thesis, Ohio. State University, 1961. 

Marwit, S. J., & Marica, J. E. Tester bias and response to projective 
instruments. Journal of Con.jlting Psychology , 1967, 31., 253-Z5B. 

Meichenbaum, D., Bowers, K. , & Ross, R. A behavioral a-^^lySpi^ ^^^^^^^^fLg 
expectancy effect. Journal of Personality and_ S^ociaj Psycholociy, 1869, 

13.. 306-316. 

Meyer, W. J.. & Thompson, G. G. Sex differences in the distribution of 

teacher approval and disapproval among sixth-grade children. Journal or 
Educational Psychology , 1956, 47^, 385-397. 

Orne, M. T. On the social psychology of the psychological experiment: With 
particular reference to demand characteristics and their implications. 
American Psychologist , 1962, 17_, 776-783. 

Overall, J. E. Multiple covariance analysis by the general least squares 
regression method. Behavioral Science , 1972, 17» 313-320. 

Overall, J. E., & Spiegel, D. K. Concerning least squares ana 1^^ of 
experimental data. Psychological Bulletin , 1969, 72^, ^^^-Ud, 

Panda, K. C, & Guskin, S. L. The effect of tutor's locus of control and 
expectancy information on tutor behavior and pupil perfornance. 
Final repo/t -submitted to the U.S. Office of Education, June, 1970. 

Parton, D. A., & Ross, A. 0. Social reinforcement of children's motor 
benavior: A review. Psychological Bulletin , i 965, 64, 65-/3. 

Pitt C C. V. An experimental study of the effects of teachers' knowledge 

'or incorrect knowledge of pupil IQ's on teachers' attitudes and practices 

anl pupils' attitudes and achievement. Unpublished doctoral dissertation, 
Columbia University, 1956. 



ERIC 



Rist. R. C. Student social class and teacher expectations: The 

self-fulfilling prophecy in ghetto education. Harvard Educational 
Review , 1970, 40, 4^.1-451. 

Rosenthal , R. Experimenter effects in be haviora l research. New York: 
Appl eton-Century-Crofts, 1965. 

Rosenthal. R. Experimenter expectancy and the reassuring nature of the 
null hypothesis decision procedure. Psychological Bulletin Monograph 
Supplement , 1968, 70, 30-47. 

Rosenthal, R. Empirical vs. decreed validation of clocks and tests. American 
Educational Research Journal , 1969, 6, 689-691. 

Rosenthal, R. Interpersonal expectations: Effects of the experimenter's 

hypothesis. In R. Rosenthal and R. Rosnow (Cds.), Artifact in behavioral 
research . New York: Academic Press, 1969. Pp. 182-277. (b) 

Rosenthal, R. On the social psychology of the self-fulfilling prophecy: 
Further evidence for Pygmalion effects and their mediating mechanisms. 
Unpublished manuscript, 1973, Harvard University. 

Rosenthal, R., & Jacobson, L. Pygmalion in the classroom. New York: 
Holt, 1968. 

Rosenthal, R., & Rubin, D. B. Pygmalion reaffirmed. In J. D. Elashoff 
and R. E. Snow (Eds.), Pygmalion Reconsidered . Worthington, Ohio: 
Charles A. Jones, 1971. Pp. 139-T5F: 

Rothbart, M., Dalfen, S., & Barrett, R. Effects of teacher's expectancy on 
student-teacher interaction. Journal of Educational Psychology, l9/t, 
62, 49-54. 

Rubovit<i, P., Maehr, M. Pygmalion analyzed: Toward an explanation of the 
Rosenthal-Jacobson findings. Journal of Personality and Social 
Psychology , 1971, 19, 197-203. 

Schwebe!, A., & Cherlin, D. Physical and social distancing in teacher-p^^ 
relationships. Journal of Educationar Psychology , 1972, 63, 543-550. 

Snow, R Unfinished Pygmalion. Contemporary Psychology , 1969, 14, 197-199. 

Staines, J. W. The self-picture as a factor in the classroom. British 
Jo irnal of E ducation , 1958, 28, 97-111. 

Stevens m, H. W., & Fahel , L. The effect of social reinforcement on the 
pe forma nee of institutionalized and noninstitutional ized normal and 
fe :bleminded children. Journal of Personality , 1961, 29, 136-147. 

Stevens m, H. W., & Hill, K. T. Use of rate as a measure of response in 
St jdies of social reinforcement. Psychological Bulletin , 1966, 66, 
321-326. 



39- 



Thorndike, R. L. Review of Pygmalion in the classroom . American 
Educational Research Journal , 196§7 57^08-711 . 

Thorndike, R. L. You have to know how to tell time. American 
Educational Research Journal , 1969, 6, 692. 

Winer, B. J. Statistical principles in experimental design. 
New York: McQraw-HilTrT962. 



